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VOLTAGE ID BASED FREQUENCY CONTROL FOR CLOCK 
GENERATING CIRCUIT 

FIELD 

The present invention relates generally to microprocessor circuits, and more 
specifically to internal clocks in microprocessor. 

BACKGROUND 

Electronic devices, such as microprocessors, are steadily operating at faster 
and faster speeds. As microprocessors run at higher and higher speeds, the power 
delivered to the microprocessors by a power supply starts to become an issue. 
Voltage drops (or droops) may occur as power is delivered from a power source to 
individual components and devices on the die of a microprocessor. For example, 
devices on a die may receive only 1 .0 volt from a power source that is supplying 1 .2 
volts due to a voltage droop. Decoupling capacitors may be used on a die to help 
reduce voltage droop. However, decoupling capacitors cost area on the die and 
also cost power due to gate oxide leakage. 

Power source voltage droops affect the speed at which an electronic device 
(e.g., microprocessor or integrated circuit) may operate. During normal operation of 
a microprocessor (or any sequential machine), noise may be generated from 
instantaneous switching. Voltage supply noise modulates the delay of data paths. 
Voltage droops reduce the maximum frequency of operation of the microprocessor. 
For example, as a voltage droop magnitude increases, the operating frequency of 
the microprocessor decreases. Thus, a large change in processor activity may 
cause substantial supply voltage transients resulting in performance loss. 



219 .41409X00 
P14297 



BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing and a better understanding of the present invention will 
become apparent from the following detailed description of example embodiments 
5 and the claims when read in connection with the accompanying drawings, all forming 
a part of the disclosure of this invention. While the foregoing and following written 
and illustrated disclosure focuses on disclosing example arrangements and 
embodiments of the invention, It should be clearly understood that the same is by 
way of illustration and example only and that the arrangements and embodiments 
10 are not limited thereto. 

The following represents brief .descriptions of the drawings in which like 
reference numerals represent like elements and wherein: 

FIG. 1 is a block diagram of an integrated circuit according to one 
arrangement; 

15 FIG. 2 is a schematic diagram of a clock distribution network according to one 

arrangement; 

FIG. 3 is a diagram of one stage of a clock distribution network according to 
one arrangement; 

FIG. 4 is a block diagram of a clock generating circuit according to an 
2 0 example embodiment of the present invention; 

FIG. 5 is a diagram of a clock generating circuit according to an example 
embodiment of the present invention; 
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FIG. 6 is a diagram showing the clock generating circuit of FIG. 5 along with 
elements to be clocked according to an example embodiment of the present 
invention; 

FIG. 7 is a diagram of a clock generating circuit according to an example 
embodiment of the present invention; 

FIG. 8 is a diagram of a clock generating circuit according to an example 
embodiment of the present invention; 

FIG. 9 is a diagram of a clock generating circuit according to an example 
embodiment of the present invention; 

FIG. 10 is a diagram of a start/stop circuit according to an example 
embodiment of the present invention; 

FIG. 1 1 is a graph showing a Vcc voltage signal that varies due to noise; 

FIG. 12 is a diagram showing a voltage ID mechanism according to an 
example embodiment of the present invention; and 

FIG. 13 is a graph showing Vcc and temperature performance over time. 

DETAILED DESCRIPTION 

In the following detailed description, like reference numerals and characters 
may be used to designate identical, corresponding or similar components in differing 
figure drawings. Further, In the detailed description to follow, example values may 
be given, although embodiments of the present invention are not limited to the 
same. While values may be described as HIGH or LOW, these descriptions of 
HIGH and LOW are intended to be relative to the discussed arrangement and/or 
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embodiment. That is, a value may be described as HIGH in one arrangement 
although it may be LOW if provided in another arrangement. Arrangements and 
embodiments may be shown in block diagram form in order to avoid obscuring the 
invention, and also in view of the fact that specifics with respect to implementation of 
such block diagram arrangements and embodiments may be highly dependent upon 
the platform within which the present invention is to be implemented. That is, such 
specifics should be well within the purview of one skilled in the art. Where specific 
details (e.g., circuits) are set forth in order to describe example embodiments of the 
invention, it should be apparent to one skilled in the art that the invention can be 
practiced without, or with variation of, these specific details. It should also be 
apparent that differing combinations of hard-wired circuitry may be used to 
implement embodiments of the present invention. That is, embodiments of the 
present invention are not limited to any specific combination of hardware. 

Embodiments of the present invention may also be described with respect to 
signals being input or output from different circuit components. It is understood that 
while the discussion identifies a signal, the signal may be transmitted over a signal 
line or similar type of mechanism. Further, the terminology signal may also 
correspond to a signal line as shown in the drawings. Well-known power/ground 
and address connections to components may not be shown within the figures for 
simplicity of illustration and discussion, and so as not to obscure the invention. 

While the following discussion may be presented with respect to 
implementation in a microprocessor, embodiments of the present invention are not 
limited to that specific implementation. Implementations for generating clock signals 
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for various digital devices such as integrated circuits, discrete logic devices, memory 
devices, devices either on the same or separate chips, communications devices, 
etc., are also within the scope of the present Invention. 

In order to better describe embodiments of the present invention being 
5 pursued in the present patent application, various embodiments and arrangements 
may Initially be described. These initial embodiments may be pursued in related 
U.S. Patent Application No. (Attorney Docket No. 219.41237X00), filed 
simultaneously with this application. 

Embodiments of the present Invention may provide a circuit that includes a 
10 clock distribution network and a multiplexing device coupled to the clock distribution 
network to select between a synchronous mode and an asynchronous mode. A 
plurality of distributed ring oscillators may asynchronously drive the clock distribution 
network in the asynchronous mode. The distributed ring oscillators may be coupled 
to a power supply such that they track the critical paths. A phase lock loop circuit 
15 (located externa! to the core circuit) may synchronously drive the clock distribution 
network in the synchronous mode. In the following discussion, the terminology 
asynchronously driving the clock distribution network may be used with reference to 
the asynchronous mode. In the asynchronous mode, the clock distribution network 
may be driven asynchronously relative to an external clock. The clock distribution 
2 0 network may be synchronously driven although it is asynchronous with respect to an 
external clock. 

Embodiments of the present invention may thereby provide power supply 
control on a microprocessor. This allows performance to be recovered since the 
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performance may be dependent on the average power supply level rather than 
minimizing power supply droop. The core clock frequency may instantaneously 
track the worst-case speedpath over Vcc noise. The instantaneous performance of 
the processor core may vary over time in response to Vcc transient. 

FIG. 1 is a block diagram of an integrated circuit according to one 
arrangement. Other arrangements are also possible. More specifically, FIG. 1 
shows an integrated circuit 100 having a core 101, an interface 105, and a clock 
generator 102. The core 101 may include circuitry and logic to perform the 
designated functions of the Integrated circuit, while the interface 105 may provide an 
interface between the core 101 and the remainder of the system and Its system 
bus(es). For instance, if the integrated circuit 100 includes a processor, the core 101 
may Include one or more decoders, scheduling logic, execution units, reorder 
buffers, memory order buffers, register files, cache memory, etc., for use in 
executing instructions. The interface 105 may Include external bus controller logic 
and programmable interrupt controller logic. 

The clock generator 102 may generate the clock signals in response to a 
system clock signal 110. The clock generator 102 may include a phase lock loop 
(PLL) circuit. The clock signals may be coupled to the core 101 and the interface 
105. The clock generator 102 may generate the bus clock signal(s) 103 and the core 
clock slgnal(s) 104. 

FIG. 2 illustrates a clock distribution network 200 according to one 
arrangement. Other arrangements are also possible. The clock distribution network 
200 distributes a clock signal to chip components such as the core 101 (shown In 
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FIG. 1). As illustrated, a feedback clock signal and a reference clock signal may be 
applied to a PLL 210, which may be provided within the clock generator 102. The 
clock distribution network 200 may include a plurality of drivers 220, 230, 240, 250, 
260, 270 and 280 to drive large capacitances, such as attributable to registers and 
latches, with the output signal of the voltage-controlled oscillator of the phase lock 
loop (PLL). The drivers 220, 230, 240. 250, 260, 270 and 280 may contain inverters 
(not shown). Hence, the capacitances may be switched at the clock frequency. The 
capacitances 265, 275, 285, 290, 295 and 255 may be the capacitances attributable 
to the components of the chip. In addition to these capacitances, the gate 
capacitances of the driver inverters may also be switched at the clock frequency. If 
the total capacitance for the clock network is represented as C and the clock 
network switches at the clock frequency, f, the amount of power dissipated may be 
represented as CV^f, where V is the supply voltage. This amount of power may be a 
significant portion of the total power utilized by the chip due to a relatively large C 
and a relatively high f. 

FIG. 3 is a diagram of one stage of a clock distribution network according to 
one arrangement. Other arrangements are also possible. More specifically, FIG. 3 
shows one stage of a clock distribution network 300 that may be provided within the 
core 101 (FIG. 1). FIG. 3 shows the terminal stage with multiple drivers driving a 
common and continuous grid. The clock distribution network 300 may also be 
provided within other entities. The core clock distribution network 300 may include 
signal traces 310 (shown vertically in the drawing figure) and signal traces 320 
(shown horizontally in the drawing figure). The core clock distribution network 300 
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may also include a plurality of drivers coupled to the signal traces 310 and 320. For 
ease of illustration, only a first driver 330 is labeled in FIG. 3. The drivers operate to 
provide clock signals (such as core clock signals) to the clock distribution network 
300 and thereby provide clock signals to respective elements (such as latches and 
registers) of the core 101 (not shown in FIG. 3). 

FIG. 4 is a block diagram of a clock generating circuit according to an 
example embodiment of the present invention. Other embodiments and 
configurations are also within the scope of the present invention. In this 
embodiment, the clock generating circuit may be provided within the core 101 
although the clock generating circuit may also be located in other locations that 
include a clock distribution network. More specifically, FIG. 4 shows a clock 
generating circuit 400 coupled between a first signal trace 312 and a second signal 
trace 322. The first signal trace 312 may be one of the signal traces 310 and the 
second signal trace 322 may be one of the signal traces 320. However, 
embodiments of the present invention are also applicable to the signal trace 312 and 
the signal trace 322 being configured into a clock distribution network different from 
the clock distribution network shown in FIG. 3. FIG. 4 only shows one clock 
generating circuit although the plurality of clock generating circuits may be 
distributed throughout the clock distribution network. 

FIG. 4 also shows a power supply (or power supply device) 350 provided 
external to the core 101, such as in a periphery (i.e., the I/O) of the integrated circuit 
about the core 101 (FIG. 1). The power supply 350 may be coupled by a power 
distribution network 355 to the clock generating circuit 400. Although not shown in 
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FIG. 4, a plurality of clock generating circuits 400 may be provided about the clock 
distribution network in a similar manner as each of the drivers provided about the 
clock distribution network 300 in FIG. 3. In other words, the clock distribution 
network may include a plurality of clock generating circuits 400 each coupled 
5 between signal traces and each powered by the power supply 350 through the 
power distribution network 355. 

FIG. 5 is a diagram of a clock distribution circuit according to an example 
embodiment. FIG. 5 shows more specific circuit elements (such as inverter circuits 
and a multiplexing device) each of which may be powered by the power supply 350 

10 coupled via the power distribution network 355. As shown, a plurality of inverter 
circuits 410, 420, 440, 450 and 460 and a multiplexing device 430 (or selecting 
device) may be coupled as a ring oscillator (or ring oscillator circuit) between the 
signal trace 312 and the signal trace 322. That is, the inverter circuit 410 receives a 
signal from the signal trace 322. The signal propagates through the inverter circuit 

15 420, through the multiplexing device 430, through the inverter circuits 440, 450 and 
460 and is output to the signal trace 312 from the last stage of the ring oscillator 
(such as the inverter circuit 460). In other words, the input signal to the ring 
oscillator and the output signal of the ring oscillator are to the clock distribution 
network. 

2 0 FIG. 5 shows five stages of a ring oscillator circuit coupled between signal 

traces. The ring oscillator circuit may include any odd number of stages so as to 
produce an oscillating circuit between signal traces. That is, while FIG. 5 shows five 
inverter circuits, embodiments of the present application are also applicable to other 
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numbers of inverter circuits coupled in series so as to produce a ring oscillator 
circuit. Furthermore, circuit elements other than inverter circuits may also be used to 
form the ring oscillator. 

The multiplexing device (or selecting device) 430 is coupled between the 
5 inverter circuit 420 and the inverter circuit 440. The multiplexing device 430 selects 
between inputs on a signal line 432 and a signal line 434. The multiplexing device 
430 may receive an input signal on the signal line 434 from the inverter circuit 420. 
The multiplexing device 430 may receive input signals on the signal line 432 from a 
phase lock loop (PLL) circuit provided external to the core 101, for example. That is, 

10 the phase lock loop circuit may provide a clock signal along the signal line 432 to the 
multiplexing device 430. 

The multiplexing device 430 may receive a select signal to select between an 
asynchronous mode and a synchronous mode. In the synchronous mode, the clock 
signal on the signal line 432 passes through the multiplexing device 430, and 

15 subsequently passes through the Inverter circuits 440, 450 and 460. The resulting 
signal is output to the signal trace 312 (i.e., the clock distributing network). This 
thereby results in the clock distribution network operating based on a synchronous 
signal provided by the PLL located external to the core 101 . On the other hand, the 
multiplexing device 430 may operate in the asynchronous mode based on the select 

20 signal. In the asynchronous mode, the signal on the signal line 434 (from the 

inverter circuit 420) passes through the multiplexing device 430 and subsequently 
passes through the inverter circuits 440, 450 and 460. The resulting signal is output 
to the signal trace 312 (i.e., the clock distribution network). The signal may also 
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propagate back along the signal trace 322 to the inverter circuits 410 and 420 and 
be subsequently passed through the multiplexing device 430 since the multiplexing 
device 430 is still operating in the asynchronous mode. Accordingly, the select 
signal applied to the multiplexing device 430 may operate the clock generating 
5 circuit in either a synchronous mode or an asynchronous mode. Although not shown 
in FIG. 5, each of the clock generating circuits provided about the clock distribution 
network may include elements (such as inverter circuits and multiplexing devices) 
similar to the elements shown in FIG. 5. Each of the respective multiplexing devices 
may separately receive a similar select signal so as to provide the appropriate mode 

10 for the entire clock distribution network. 

The multiplexing device 430 may be considered part of the ring oscillator. 
The multiplexing device 430 may be a pass-through element, which makes it 
logically passive for the loop. However, the multiplexing device 430 may contribute 
to delay (and therefore period) and Vcc sensitively of the delay of the loop. 

15 FIG. 6 is a diagram showing the clock generating circuit of FIG. 5 along with 

elements to be clocked according to an example embodiment of the present 
invention. Other embodiments and configurations are also within the scope of the 
present invention. For ease of illustration, FIG. 6 only shows two clock generating 
circuits, namely a first clock generating circuit (in which the components are labeled 

20 A) and a second clock generating circuit (in which the components are labeled B). 
Both the first dock generating circuit and the second clock generating circuit may be 
similar to the clock generating circuit shown in FIG. 5 although other embodiments 
for a clock generating circuit are also within the scope of the present invention. 
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More specifically, the first clock generating circuit includes inverter circuits 
41 OA, 420A, 440A, 450A and 460A as well as a multiplexing device 430A. Similarly, 
the second clock generating circuit includes inverter circuits 41 OB, 420B, 440B, 
450B and 460B as well as a multiplexing device 430B. The first clock generating 
5 circuit may be coupled between the signal trace 312 and the signal trace 322. The 
second clock generating device may be coupled between a signal trace 314 (such 
as one of the signal traces 310) and the signal trace 322. FIG. 6 shows elements of 
the core 101 such as a D flip-flop circuit 510, logic 520 and a D flip-flop circuit 530. 
The D flip-flop circuits 510 and 530 may be clocked by the clock distribution network, 

10 such as clock signals on the signal trace 322. The logic 520 may include any type of 
latch, mechanism or state machine to perform a desired function in the core 101 . 

FIG. 7 is a diagram of a clock generating circuit according to an example 
embodiment of the present invention. Other embodiments and configurations are 
also within the scope of the present invention. In this embodiment, portions of the 

15 ring oscillator may be shared between a first clock generating circuit and a second 
clock generating circuit. More specifically, the inverter circuits 410 and 420 may be 
commonly used for both a first clock generating circuit (formed by at least the 
inverter circuits 440A, 450A and 460A) and a second clock generating circuit 
(formed by at least inverter circuits 440C, 450C and 460C). For ease of illustration, 

2 0 the multiplexing device for use in the ring oscillator is not shown in FIG. 7 although 
the multiplexing device may be provided after the inverter circuit 420 in one 
embodiment. The first clock generating circuit may output a clock signal on the 
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signal trace 312 and the second clock generating circuit may output a clock signal 
on the signal trace 312. 

FIG. 8 is a diagram of a clock generating circuit according to another example 
embodiment of the present invention. Other embodiments and configurations are 
also within the scope of the present invention. In this embodiment, elements of the 
ring oscillator may be shared between clock generating circuits. In this embodiment, 
adjacent ring oscillators may be "horizontally" coupled to share elements such as 
inverter circuits. For example, each of inverter circuits 502, 504, 506, 508 and 510 
are shared between adjacent ring oscillators. This may help reduce wire delays. 
The clock signals may be output from the last stage of each ring oscillator circuit to a 
signal trace such as the signal trace 322. In this example, inverter circuits 512, 514, 
516, 518 and 520 are the last stage of each ring oscillator. Although not shown in 
FIG, 8, multiplexing devices may be provided prior to the last inverter circuit of each 
ring oscillator circuit. 

FIG. 9 is a diagram of a clock generating circuit according to another example 
embodiment of the present invention. Other embodiments and configurations are 
also within the scope of the present invention. In this embodiment, elements of the 
ring oscillators may be shared between clock generating circuits. In this 
embodiment, adjacent ring oscillators may be "horizontally" coupled to share 
elements. Ring oscillators may also be "vertically" coupled to share elements. The 
clock signals may be output from the last stage of the ring oscillator circuits to 
different signal traces. In this example, inverter circuits 542, 544, 546 and 548 are 
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the last stage of each ring oscillator. Although not shown in FIG. 9, multiplexing 
devices may be provided prior to the last inverter circuit of each ring oscillator circuit. 

As discussed above, the synchronous mode's signal may originate from a 
phase lock loop circuit located external from the core and be distributed within the 
clock distribution network. On the other hand, the asynchronous mode's signal may 
originate from itself. In a stopped state, all the ring oscillators may be de-asserted 
by an enable signal to one of the oscillator stages, which may be a NAND gate, for 
example. When an enable input is asserted high, then oscillation may begin. The 
enable signal to all the oscillators may be asserted simultaneously to start all the 
oscillators together. 

More specifically, FIG. 10 shows a start/stop circuit according to an example 
embodiment of the present invention. Other embodiments and configurations are 
also within the scope of the present invention. The start/stop circuit operates to start 
driving the clock distribution network and to stop driving the clock distribution 
network. FIG. 10 shows a multiplexing device 610 to select between a synchronous 
mode and an asynchronous mode based on a synchronous/asynchronous select 
signal, which is also used as the select signal for the multiplexing device of each of 
the ring oscillator circuits. FIG. 10 also shows portions of the clock distribution 
network 620 (such as respective signal traces) coupled to inputs of NAND gates 
640, 650 and 660. Other inputs to the NAND gates 640, 650 and 660 may be from 
signal lines between stages of each ring oscillator circuit. The output of each of the 
NAND gates 640, 650 and 660 is to one input of the multiplexing device (such as the 
multiplexing device 430) of each ring oscillator circuit. Accordingly, the circuit shown 
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in FIG. 10 provides start/stop capabilities in the asynchronous mode. 

In the synchronous mode, the PLL may drive the clock distribution network, 
and the core clock. When the asynchronous mode is selected, then the clock 
distribution network may be driven by the asynchronous start signal (shown as 
async_start), which is initially low and thus the core dock may be stopped. The 
async_start signal may be asserted to start the asynchronous mode oscillator. 
Since this signal travels down the clock distribution network, a simultaneous start 
may occur. This may be the same for subsequent stops/starts. 

FIG. 1 1 is a graph showing the Vcc voltage signal on a die. As shown, the 
Vcc signal may vary due to reasons such as noise. The noise may impart Vcc 
modulation with peaks and valleys. FIG. 11 also shows a minimum Vcc(t) value that 
is located at a value no greater than the largest valley of the Vcc signal. In this 
arrangement, the frequency of the core would be at the lowest Vcc droop. 
Embodiments of the present invention allow the die to operate at an average Vcc(t) 
value. That is, the frequency of the core may track Vcc instantaneously. As clearly 
shown in FIG. 1 1 , the average Vcc(t) value is higher than the minimum Vcc(t) value. 
This allows the performance to be based on the average power supply level rather 
than the minimum power supply level. This also reduces the need to minimize the 
power supply droop as in disadvantageous and costly arrangements. 

Embodiments of the present invention have been described with regard to a 
method and apparatus to couple a core frequency to an instantaneous power supply 
such that the core frequency tracks the power supply (Vcc) to maintain functionality 
of the core logic in the face of severe supply noise. Embodiments of the present 
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invention may include distributed oscillators to drive a common clock distribution 
network. The distributed oscillators may filter out the uncorrelated noise and 
respond to global supply noise. Embodiments of the present invention may further 
provide an asynchronous core I/O interface flexible enough to allow a wide range of 
5 instantaneous frequency ratios between the core and the I/O. An I/O ring around 
the core may run on a phase lock loop circuit synchronizing the I/O to the external 
world and thereby presenting a synchronous interface to the outside. An internal 
core phase lock loop circuit may drive the internal core clock in a synchronous mode 
to facilitate testing. 

10 Embodiments of the present invention may provide a clock distribution 

network driven by a regular array of identical oscillators. This uniform structure may 
ensure that all the oscillators toggle simultaneously to produce a clock that reaches 
any point on the die at a coherent frequency with minimal skew between points. 
Each oscillator may include a ring oscillator made out of an odd number of inversion 

15 stages. These stages may be CMOS technology so that the elements of the ring 
oscillator track the power supply and temperature in the same fashion as the core 
datapath logic. The ring oscillator length may also be adjustable. The ring oscillator 
may be adjusted to a length that produces a period just long enough to ensure 
functionality of the worst-case core speedpath under any power supply and 

2 0 temperature condition. As Vcc-Vss increases, the worst speedpath may need less 
time to evaluate, and the oscillator frequency may proportionately increase to keep 
track. On the other hand, as Vcc-Vss decreases, the worst speedpath may require 
more evaluation time and that increases in time may be provided by the slower 
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oscillator frequency. As a result, the Instantaneous oscillator frequency may track 
the worst speedpath, thus ensuring functionality over any (voltage, temperature) 
condition. Some amount of margin may counter any locally uncorrected noise in 
voltage and temperature. 
5 One alternate mode may employ the PLL as a clock source. This allows 

testing of the microprocessor in the traditional synchronous mode, in which machine 
behavior is predictable on a cycle-by-cycle basis. 

The preceding arrangements and embodiments have been described in U.S. 
Patent Application (Attorney Docket No. 219.41237X00), filed simultaneously with 

10 this application. The following embodiments may be pursued in this patent 

application, although these embodiments are also applicable and combinable with 
the embodiments and features described above 

As discussed above, a free-running oscillator may be used to run the 
microprocessor core and may extract a maximum performance. However, a 

15 microprocessor having a rated frequency may be more marketable. Thus, while the 
instantaneous frequency may be allowed to wander while the processor tracks the 
power supply noise, a microprocessor may also have a rated average frequency that 
reflects its average performance. This may be accomplished by a voltage ID (VID) 
mechanism where commands may be sent to an external voltage regulator module 

20 (VRM) to increase or decrease the voltage level (Vcc) supplied to the processor (i.e., 
the core). The power supply may therefore be used as the control voltage. This 
may allow the processor to throttle the voltage (and therefore the frequency) 



17 



219 . 41409X00 
P14297 

automatically and in an efficient manner. The throttling mechanisms (e.g. thermal, 
performance, reliability) may be simplified to a centralized control. 

Embodiments of the present invention may allow the core operating in the 
asynchronous mode to be frequency governed to a constant average frequency and 
5 thereby run at a constant (i.e., unthrottled) performance. This may allow speed 
grading of parts during manufacturing. This may also allow continuous binning in 
which the bin size is arbitrary. This differs from disadvantageous arrangements in 
which the bin size may be limited by fixed bus ratio steps as allowed by a PLL circuit. 
Furthermore, a spread spectrum may be imparted on the oscillator by introducing a 

10 suitable modulation. Further, the power/reliability/performance may allow longer 
battery life (such as for laptops) while extending performance (such as for desktop 
computers) by converting surplus into higher frequency/voltage. 

Embodiments of the present invention may provide a frequency control circuit 
that includes a mechanism (such as a VID mechanism) to adjust a frequency of 

15 clock signals output from a plurality of distributed oscillators by using a supply 

voltage, which is the same voltage signal used to power critical components on-die. 
That is, the supply voltage (Vcc) is used as the voltage signal for the plurality of 
distributed oscillators. The frequency control circuit may control the frequency of 
circuit components that operate at a frequency dependent on a power supply voltage 

2 0 FIG. 12 shows an example of a VID mechanism according to an example 

embodied in the present invention. Other embodiments and configurations are also 
within the scope of the present invention. The VID mechanism may also be referred 
to as a VID controlled frequency (VCF) mechanism (or frequency control circuit). 
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More specifically, FIG. 12 shows a VID controller 710, a divide-by-N circuit 730 and 
a comparator circuit 740 provided within the core 101 . The VID controller 710, the 
divide-by-N circuit 730 and the comparator device 740 operate along with a voltage 
regulator module (VRM) 800 (located external to the core 101) and a plurality of 
5 distributed oscillators (VCOs) 850 to form a closed loop feedback system. The 
VCOs 850 may correspond to the plurality of ring oscillators provided about a clock 
distribution network discussed above. Each of the stages (such as the inversion 
stages) of the VCOs 850 may be CMOS technology so that the components track 
the power supply and temperature in the same fashion as the core database logic. 

10 That is, each oscillator may be directly connected to Vcc as shown in FIG. 12 (and 
through a power distribution network). Embodiments of the present invention may 
employ Vcc as a control voltage to each of the VCOs 850 via the power distribution 
network. Components other the VCOs 850 may be powered by Vcc. 

The closed loop feedback system may be obtained by the VID controller 710 

15 issuing a VID to the VRM 800, which in turn outputs the corresponding Vcc level. 
The Vcc may be translated to the correct frequency in the core clock. That is, the 
core clock may be regulated such that the average frequency is N times the external 
clock. This may be accomplished using the divide-by-N circuit 730. 

FIG. 12 shows an external clock (or system clock) may be input to the core 

20 (such as the core 101) and input to the comparator device 740 (such as a frequency 
comparator). Another input of the comparator device 740 may be a signal based on 
the core clock signals (output from the ring oscillator circuits) that pass through the 
divide-by-N circuit 730. The comparator device 740 outputs a signal to the VID 
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controller 710 based on the input signals. The VID controller issues a VID to the 
VRM 800. The VRM 800 outputs the correct Vcc level to the power distribution 
network so as to power components of the distributed ring oscillators (or other 
components) as well as to power other components (such as elements or logic) 
5 located on-die. Based on the received power signals, the VCOs 850 output core 
clock signals via the clock distribution network as discussed above. The signals may 
be input to the divide-by-N circuit 730 and pass to the comparator device 740. This 
closed loop feedback system thereby controls a frequency of the core clock signals 
on the clock distribution network by controlling the voltage to the VCO's 850. 
10 Accordingly, a frequency control circuit may monitor the frequency of the core clock 
and adjust the power supply to the VCOs 850 via the power distribution network. 
This adjusts the frequency of the core clock. As described above, each component 
of the VCOs 850 may be very sensitive to the environment such as temperature and 
voltage. 

15 The VID controller 710 may output the VID as a digital code along signal lines 

of the VRM 800. These signals may be parallel signals (such as five bits, for 
example). The signals may also be serially sent from the VID controller 710 to the 
VRM 800. Serial VID is described in U.S. Patent Application No. 09/895,062 filed 
June 29, 2001. The VID controller 710 may be a digital circuit on-die that decides 

2 0 the VID code to drive the VRM 800. A VID table may specify the digital code versus 
the analog Vcc voltage to the supplied VRM 800. The processor may decide which 
VID to send to the VRM 800. 
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Embodiments of the present invention may also include a temperature sensor 
provided about the VCOs 850 (or other component) to measure the operating 
temperature (Tj). The system may also be configured such that the VID controller 
710 outputs the VID to the VRM 850 based on the sensed temperature and a 
5 desired temperature. Further, the VID controller 710 may provide the VID as a 
function of the temperature and/or the frequency. Operations further relating to the 
setting of the operating frequency based on the operating temperature and an 
operating voltage are provided in U.S. Patent Application No. (Attorney Docket No. 
42390. PI 4325), filed simultaneously herewith, and entitled "Altering Operating 

10 Frequency And Voltage Set Point Of A Circuit In Response To The Operating 
Temperature And Instantaneous Operating Voltage Of The Circuit." 

According to embodiments of the present invention, the frequency control 
circuit may operate in a frequency maximizing mode and/or a power minimizing 
mode. Other modes of operation are also within the scope of the present invention. 

15 In a frequency maximizing mode, the frequency control circuit may direct the voltage 
regulator (such as the VRM 800) to increase the Vcc to a maximum allowed by the 
presently sensed temperature on the processor (such as about the VCOs 850). In a 
power minimizing mode, the frequency control circuit may reduce the Vcc as long as 
a minimum target frequency is met. This mode may utilize only enough power to 

20 attain the target frequency. The reduced Vcc will conserve power. 

Embodiments of the present invention may also include a software selectable 
mode switch. The operating system may decide when performance or power is 
more important and operate the switch to the respective mode. The operating 
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system may decide to go into low power mode, and also to change the target 
frequency, thereby allowing an even lower Vcc, Fmax and power. 

Embodiments of the present invention may also include software selectable 
speed targets. A speed target charges the frequency that a component (such as a 
5 processor) was originated for. There may be programmable limits on those speed 
targets. For example, speed targets such as 2x, 1x, 0.75x, and 0.5x may be 
available. That is, if a component was a 5GHz component, then a 2x target implies 
10GHz. The component may deliver the highest frequency it is able to given the 
present temperature (and derivatively, the present Vcc). A .5x speed target may 

10 cause a 5GHz component to attempt to slow down to 2.5 GHz. The component may 
be unable to slow down this much by dropping to its minimum operating voltage. 
However, the component may go to an extremely low power condition. Other speed 
targets are also within the scope of the present invention. 

FIG. 13 is a graph showing performance/power/reliability autothrottling 

15 according to an example embodiment of the present invention. Other embodiments 
are also within the scope of the present invention. More specifically, the graph 
shows how the Vcc and the temperature Tj(t) vary over time based on a load where 
the load current is proportional to activity in the processor. The Vcc(t) may be 
thermally limited near Vccmax. This graph shows that the Vcc and Tj may increase 

2 0 based on a load (or activity in the processor). 

More specifically, FIG. 13 shows the ability of the processor to reach to load 
demands using temperature as a secondary control parameter. That is, as demand 
increases, activity increases, and silicon junction temperature (Tj) increases, which 
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slows down the oscillators. The VFC loop may then force Vcc higher, which in turn 
increases Tj, thereby forcing Vcc higher, and Tj even higher until a maximum point is 
reached. The maximum point may be defined by an acceptable (Vcc, Tj) pair. The 
maximum point may represent the maximum performance that can be reliably and 
5 consistently extracted from the processor. 

On the other hand, as the demand decreases, the VFC loop may quickly 
settle toward Vccmin, As Tj cools further, Vcc cannot be reduced below Vccmin 
anymore, and the Vcc loop will naturally force a higher core frequency. There are 
three options at this point. First, a higher core frequency may be allowed. The 

10 system may relay on Tj and core frequency to equilibrate while keeping 
Vcc=Vccmin. This option may result in a higher core frequency while 
activity/demand is low. As a second option, if low background performance is 
acceptable, then the ratio N may be reduced iteratively to reach a lower core 
frequency, until an equilibrium is reached whereby Vcc is slightly above Vccmin. As 

1 5 a third option, if an idle state is desired to reduce power further, then an override 
mechanism may disable the VFC loop and go into a sleep (i.e., stop clock) state. 

Advantages of embodiments of the present Invention will now be described. 
At the highest operating point, Vcc may be at its maximum, and correspond to the 
hottest junction temperature. This operating point may be limited by reliability. 

2 0 Rated performance may need to be guaranteed at this highest operating point. This 
maximum point may be dictated by maximum performance demand (or load 
current). As performance demand (or load current) decreases, the junction 
temperature cools. This may bring a temporary increase in the core frequency, 
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which is counteracted by negative feedback to bring about a decrease in Vcc, thus 
maintaining constant average frequency. As performance demand decreases 
further, Vcc and temperature reduce hand in hand until the lowest load current state 
is reached. 

Thus, negative feedback continually forces a Vcc, which is always just 
sufficient to sustain the rated core frequency, to a Vcc that is not too high 
(inefficiency) or too low (speedpath failure) for any valid temperature condition. 
There may be a rapid transfer from a highest performance point to a lowest 
performance point, thus conferring rapid energy delivery-on-demand (or energy 
conservation-on-demand) along an efficient transition curve. This efficiency may 
translate to a large increase in battery life for laptop or portable computers, for 
example. 

In order to lock to a desired bus-to-core frequency ratio, the digital voltage 
frequency control (VFC) machine may count the internal core clocks and attempt to 
guide the internal core frequency to a long-term lock point that is a pre-determined 
ratio with respect to the external frequency. At lock, the VID may vary between two 
or more code values. This feedback may be performed with a slow time constant 
such that power supply droops (due to noise) are tracked by the oscillators. For 
example, quick first droops in the nanosecond scale may be addressed through 
instantaneous frequency tracking of the oscillators. This tracking may allow the core 
frequency to rise and fall in concert with supply noise, thus making the processor 
noise-tolerant, as its frequency is not limited by the lowest instantaneous Vcc level. 
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The bus-to-core frequency ratio may be very arbitrary. In PLL-based 
synchronous core designs according to disadvantageous arrangements, the ratio 
may be an integer, or a simple low integer fraction. According to an embodiment of 
the present invention, the ratio may take on any positive real number (even less than 
1). This flexibility may enhance marketability. 

A voltage monitor may ensure reliable operation at both the high and low 
ends. The highest operating point may be limited by reliability, which can be 
computed based on junction temperature, frequency, and voltage profiles. The 
average frequency may be known, the junction temperature may be reported from 
an on-die temperature sensor, and the voltage may be detected on-die (i.e., by a 
bandgap reference). By employing a digital calculator or the like, the optimal 
maximum supply voltage point may be extracted without sacrificing performance 
margin through conservative guardbanding. In other words, by computing the real- 
time usage of the part, taking into account derating due to non-1 00% loading, the 
maximum Vcc may be pushed higher to use up the reliability margin, thus converting 
it to more performance. 

At the lowest operating point, the minimum Vcc may be usually determined to 
be the point at which functionality cannot be guaranteed due to finite headroom of 
certain circuits or increased variability of circuit performance. This Vcc minimum 
cutoff point may be used to force the core frequency to zero, thus entering a deep 
sleep powerdown state. 

The voltage frequency control scheme may allow for deterministic power 
throttling states where the core frequency is throttled down from N to M, where M is 
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lower than N, and is a fixed proportion of N. M can be deterministicaily picked such 
that the performance of the processor is not too much lower than its nominal ratio N 
such that it encroaches Into the next lower frequency bar. 

Embodiments of the present Invention have been described with respect to a 
clock generating circuit, a clock distribution network and a power distribution 
network. The clock distribution network is intended to include clock distribution grids 
and clock distribution trees and their equivalence. The clock generating circuit 
and/or power distribution network may be provided in areas (other than the core) 
that include any type of clock distribution network and/or power distribution network. 

Any reference in this specification to "one embodiment", "an embodiment", 
"example embodiment", etc., means that a particular feature, structure, or 
characteristic described in connection with the embodiment Is included in at least 
one embodiment of the invention. The appearances of such phrases in various 
places in the specification are not necessarily all referring to the same embodiment. 
Further, when a particular feature, structure, or characteristic is described in 
connection with any embodiment, it is submitted that it is within the purview of one 
skilled in the art to effect such feature, structure, or characteristic in connection with 
other ones of the embodiments. Furthermore, for ease of understanding, certain 
method procedures may have been delineated as separate procedures; however, 
these separately delineated procedures should not be construed as necessarily 
order dependent in their performance, i.e., some procedures may be able to be 
performed in an alternative ordering, simultaneously, etc. 

Although embodiments of the present invention have been described with 
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reference to a number of illustrative embodiments thereof, it should be understood 
that numerous other modifications and embodiments can be devised by those skilled 
in the art that will fall within the spirit and scope of the principles of this invention. 
More particularly, reasonable variations and modifications are possible in the 
5 component parts and/or arrangements of the subject combination arrangement 
within the scope of the foregoing disclosure, the drawings and the appended claims 
without departing from the spirit of the invention. In addition to variations and 
modifications in the component parts and/or arrangements, alternative uses will also 
be apparent to those skilled in the art. 

.0 

What is claimed is: 
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